Handling Disagreement in Hate Speech Modelling
نویسندگان
چکیده
Abstract Hate speech annotation for training machine learning models is an inherently ambiguous and subjective task. In this paper, we adopt a perspectivist approach to data annotation, model evaluation hate classification. We first focus on the process argue that it drastically influences final quality. then present three large datasets incorporate annotator disagreement use them train evaluate models. As main point, propose through lens of by applying proper performance measures both annotators’ agreement models’ further poses intrinsic limits achievable When comparing annotators, observed they achieve consistent levels across datasets. reflect upon our results some methodological ethical considerations can stimulate ongoing discussion modelling classification with disagreement.
منابع مشابه
Hate Me, Hate Me Not: Hate Speech Detection on Facebook
While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical viol...
متن کاملHate speech, volition, and neurology
In ‘A Hypothetical Neurological Association between Dehumanization and Human RightsAbuse’,1 GailMurrowandRichardMurrowposit a biological explanationof how hate speech can spur violence, not only among individuals but, even, on a societal scale. They elaborate historical examples, cite to neuronal studies on patterns of responses in observation of pain and suffering to explain the dehumanization...
متن کاملChallenges in discriminating profanity from hate speech
In this study we approach the problem of distinguishing general profanity from hate speech in social media, something which has not been widely considered. Using a new dataset annotated specifically for this task, we employ supervised classification along with a set of features that includes n-grams, skip-grams and clustering-based word representations. We apply approaches based on single class...
متن کاملDetecting Hate Speech in Social Media
In this paper we examine methods to detect hate speech in social media, while distinguishing this from general profanity. We aim to establish lexical baselines for this task by applying supervised classification methods using a recently released dataset annotated for this purpose. As features, our system uses character n-grams, word n-grams and word skip-grams. We obtain results of 78% accuracy...
متن کاملAutomatic Detection of Online Jihadist Hate Speech
We have developed a system that automatically detects online jihadist hate speech with over 80% accuracy, by using techniques from Natural Language Processing and Machine Learning. The system is trained on a corpus of 45,000 subversive Twitter messages collected from October 2014 to December 2016. We present a qualitative and quantitative analysis of the jihadist rhetoric in the corpus, examine...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Communications in computer and information science
سال: 2022
ISSN: ['1865-0937', '1865-0929']
DOI: https://doi.org/10.1007/978-3-031-08974-9_54